Non-Negative Matrix Factorization Applied to Auditory Scenes Classification
نویسندگان
چکیده
This master's thesis is dedicated to the automatic classification of auditory scene using non-negative matrix factorization. A particular attention is paid to the performances achieved by the non-negative matrix factorization in sound sources detection. Our intuition was that a good classification could be achieve if we could efficiently detect the sources within auditory scenes. It appears on short artificial examples that taking into account the non-stationarity of the spectral content of the sound sources improves the source detection. Finally, our classification method is applied to a corpus of soundscapes of train stations and the results are compared with previous classifications methods. We finally conclude that using non-negative matrix factorization significantly improves the classification. Ce rapport de master est dédié à la classification automatique de scènes sonores utilisant la factorisation en matrices non-negatives. Une attention particulière est portée aux performances de la factorisation en matrices non-negatives dans le cadre de la détection de sources sonores. Notre intuition première a été qu'une classification performante pourrait être réalisée grâce une détection de sources efficace. Il s'est révélé sur de courts ex-emples artificiels que la prise en compte de la non-stationarité du contenu spectral des sources sonores améliore la détection de sources. Enfin, notre méthode de classification a été appliquée à un corpus de paysages sonores de gares et les résultats ont été comparé à d'autres méthodes de classification. Nous avons finalement conclu que l'utilisation de la factorisation en matrices non-négatives améliore significativement la classification. ii Acknowledgements
منابع مشابه
Iterative Weighted Non-smooth Non-negative Matrix Factorization for Face Recognition
Non-negative Matrix Factorization (NMF) is a part-based image representation method. It comes from the intuitive idea that entire face image can be constructed by combining several parts. In this paper, we propose a framework for face recognition by finding localized, part-based representations, denoted “Iterative weighted non-smooth non-negative matrix factorization” (IWNS-NMF). A new cost fun...
متن کاملVoice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملA Projected Alternating Least square Approach for Computation of Nonnegative Matrix Factorization
Nonnegative matrix factorization (NMF) is a common method in data mining that have been used in different applications as a dimension reduction, classification or clustering method. Methods in alternating least square (ALS) approach usually used to solve this non-convex minimization problem. At each step of ALS algorithms two convex least square problems should be solved, which causes high com...
متن کاملA new approach for building recommender system using non negative matrix factorization method
Nonnegative Matrix Factorization is a new approach to reduce data dimensions. In this method, by applying the nonnegativity of the matrix data, the matrix is decomposed into components that are more interrelated and divide the data into sections where the data in these sections have a specific relationship. In this paper, we use the nonnegative matrix factorization to decompose the user ratin...
متن کاملImproving LNMF Performance of Facial Expression Recognition via Significant Parts Extraction using Shapley Value
Nonnegative Matrix Factorization (NMF) algorithms have been utilized in a wide range of real applications. NMF is done by several researchers to its part based representation property especially in the facial expression recognition problem. It decomposes a face image into its essential parts (e.g. nose, lips, etc.) but in all previous attempts, it is neglected that all features achieved by NMF ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011